# Low-resource Language Processing

Kyrgyzbert
Apache-2.0
A small-scale language model based on the BERT architecture, specifically designed for Kyrgyz natural language processing applications.
Large Language Model Transformers Other
K
metinovadilet
79
2
Mt5 Large HuAMR
Apache-2.0
An abstract meaning representation parser fine-tuned on the Hungarian AMR dataset based on google/mt5-large
Large Language Model Transformers Other
M
SZTAKI-HLT
33
1
Turkish Medical Question Answering
MIT
A BERT-based fine-tuned Turkish medical QA model specialized in extracting answers from medical texts
Question Answering System Transformers Other
T
kaixkhazaki
20
1
Opus Mt Tc Bible Big Deu Eng Fra Por Spa Mul
Apache-2.0
A universal Transformer model supporting over 100 languages, suitable for various natural language processing tasks
Large Language Model Transformers Supports Multiple Languages
O
Helsinki-NLP
203
1
Bntqa Mbart
MIT
BnTQA-mBart is a low-resource Bengali table question answering model based on the mBART architecture, specifically designed for handling Bengali structured table data question answering tasks.
Question Answering System Other
B
vaishali
17
0
Bert Base Turkish Uncased Ner
MIT
Turkish named entity recognition model fine-tuned based on dbmdz/bert-base-turkish-uncased
Sequence Labeling Transformers Other
B
saribasmetehan
54
5
Urdu Emotions Whisper Medium
Apache-2.0
Urdu emotion recognition model fine-tuned on Whisper-medium, achieving 91.67% accuracy on the evaluation set
Audio Classification Transformers
U
Pak-Speech-Processing
43
0
Gibberish Sentence Detection Model Tr
MIT
This model is fine-tuned based on the BERT architecture for detecting gibberish text (e.g., random character combinations) in Turkish.
Text Classification Transformers Other
G
TURKCELL
40
6
English To Urdu Translation Mbart
This is an mBART model fine-tuned for English-to-Urdu translation tasks, based on the facebook/mbart-large-50 architecture and trained on a custom dataset.
Machine Translation Transformers Supports Multiple Languages
E
abdulwaheed1
106
2
Nllb 200 3.3B Ct2 Int8
A multilingual processing model supporting over 100 languages and writing systems, covering from mainstream languages to various dialects and minority languages
Large Language Model Transformers Supports Multiple Languages
N
OpenNMT
65
5
Sentence Similarity Nepali
This is a Nepali sentence similarity calculation model based on sentence-transformers, which maps sentences and paragraphs into a 768-dimensional vector space.
Text Embedding Transformers Other
S
syubraj
18
3
Sinmt5
This model is a multilingual summarization model based on the mT5 architecture, specifically fine-tuned for Sinhala to generate abstractive summaries of CNN Daily Mail Sinhala news.
Text Generation Transformers
S
Hamza-Ziyard
14
0
Xlm Roberta Base Finetuned Urdu
Urdu text classification model based on XLM-RoBERTa architecture for binary sentiment analysis
Text Classification Transformers Other
X
hassan4830
57
8
Malayalam Summariser
Apache-2.0
A Malayalam news summarization model fine-tuned from google/mt5-small
Text Generation Transformers Other
M
akhisreelibra
60
0
Mbert Multiconer22 Bn
This model is designed for the Bengali track of the SemEval Multiconer task, focusing on natural language processing tasks such as Named Entity Recognition (NER).
Sequence Labeling Transformers
M
sumitrsch
39
2
Indic Bert Multiconer22 Bn
This is a model for the SemEval Multiconer task in the Bengali track, focusing on named entity recognition.
Sequence Labeling Transformers
I
sumitrsch
32
2
Ner Marathi Bert
Apache-2.0
Marathi named entity recognition model fine-tuned from bert-base-multilingual-cased
Sequence Labeling Transformers
N
lakshaywadhwa1993
15
0
Indicner
MIT
IndicNER is a model specifically trained for recognizing named entities in sentences of 11 Indian languages, fine-tuned based on the bert-base-multilingual-uncased model.
Sequence Labeling Transformers Other
I
ai4bharat
45.85k
20
Hiner Original Muril Base Cased
Hindi Named Entity Recognition model based on MuRIL architecture, trained on the HiNER-original dataset
Sequence Labeling Transformers
H
cfilt
742
0
Bert2bert Indonesian Summarization
Apache-2.0
A BERT-base fine-tuned model for Indonesian text summarization, suitable for automatic summarization of Indonesian news articles
Text Generation Transformers Other
B
cahya
219
5
Robbert V2 Dutch Ner
MIT
RobBERT is the state-of-the-art Dutch BERT model, pretrained on a large scale and adaptable to various text tasks through fine-tuning.
Large Language Model Other
R
pdelobelle
76.94k
3
Xlm Roberta Base Finetuned Luganda Finetuned Ner Swahili
This is a named entity recognition model based on the XLM-RoBERTa model, fine-tuned on the Swahili portion of the MasakhaNER dataset.
Sequence Labeling Transformers Other
X
mbeukman
17
0
Codeswitch Hineng Ner Lince
MIT
This is a pre-trained named entity recognition model specifically designed for Hindi-English code-mixed data, trained on the LinCE dataset.
Sequence Labeling Supports Multiple Languages
C
sagorsarker
53
1
Cino Large V2
Apache-2.0
A multilingual pretrained model for Chinese and 7 minority languages in China
Large Language Model Transformers Supports Multiple Languages
C
hfl
110
11
Wav2vec2 Large Xlsr Gu
Apache-2.0
Gujarati automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving 23.55% WER on OpenSLR dataset
Speech Recognition Other
W
gchhablani
3,582
0
Wav2vec2 Large Xlsr Bengali
A Bengali automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained using OpenSLR dataset.
Speech Recognition Transformers
W
tanmoyio
24.32k
3
Wav2vec2 Large Xls R 300m Marathi Cv8
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Marathi speech dataset based on Facebook's wav2vec2-xls-r-300m model.
Speech Recognition Transformers Other
W
infinitejoy
443
1
Javanese Distilbert Small
MIT
A Javanese masked language model based on DistilBERT, trained on Javanese Wikipedia
Large Language Model Transformers Other
J
w11wo
22
0
Wav2vec2 Base Gujarati Demo
Apache-2.0
This is an automatic speech recognition model fine-tuned on Gujarati based on facebook/wav2vec2-large-xlsr-53, with a test WER of 28.92%.
Speech Recognition
W
jaimin
25
1
Gpt2 Persian Question Answering
A Persian Q&A generation model based on the GPT2 architecture, specifically designed for Persian Q&A tasks.
Question Answering System Other
G
flax-community
25
4
Mt5 Base Wikinewssum Spanish
Apache-2.0
A Spanish abstract generation model fine-tuned based on google/mt5-base, excelling at extracting key information from text to generate concise summaries
Text Generation Transformers
M
airKlizz
17
0
Sahajbert NER
Apache-2.0
A named entity recognition model fine-tuned on Bengali datasets based on sahajBERT, capable of identifying entity types such as person names, organization names, and location names.
Sequence Labeling Transformers Other
S
neuropark
17
2
Opus Mt Fr Ig
Apache-2.0
This is a Transformer-based machine translation model for French (fr) to Igbo (ig), developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
20
0
Opus Mt Tw Fi
Apache-2.0
opus-mt-tw-fi is a machine translation model based on the transformer-align architecture, specifically designed for translating Ghana's Twi language (tw) into Finnish (fi).
Machine Translation Transformers Other
O
Helsinki-NLP
63
0
Bert Khmer Small Uncased
This is a pretrained model specifically for the Khmer language, developed by Tsinghua University team, aiming to support natural language processing tasks in Khmer.
Large Language Model Transformers
B
GKLMIP
19
0
Bert Khmer Base Uncased Tokenized
This is a pretrained model specifically for the Khmer language, developed by the Tsinghua University team.
Large Language Model Transformers
B
GKLMIP
22
1
Wav2vec2 Hausa2 Demo Colab
Apache-2.0
This model is a Hausa speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers
W
Arnold
19
1
Opus Mt Fr Wls
Apache-2.0
opus-mt-fr-wls is a machine translation model based on the Transformer architecture, specifically designed for translating French (fr) to Wallisian (wls).
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
46
0
Opus Mt Fr Ase
Apache-2.0
A Transformer-based neural machine translation model for French to American Sign Language (ASE), developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
14
0
Opus Mt En Lun
Apache-2.0
A Transformer-based machine translation model for English to Lunda, developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
28
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase